Picture for Chenhang Cui

Chenhang Cui

VitaBench 2.0: Evaluating Personalized and Proactive Agents in Long-Term User Interactions

Add code
May 26, 2026
Viaarxiv icon

WildRoadBench: A Wild Aerial Road-Damage Grounding Benchmark for Vision-Language Models and Autonomous Agents

Add code
May 19, 2026
Viaarxiv icon

Do LLMs and VLMs Share Neurons for Inference? Evidence and Mechanisms of Cross-Modal Transfer

Add code
Feb 22, 2026
Viaarxiv icon

Transport and Merge: Cross-Architecture Merging for Large Language Models

Add code
Feb 05, 2026
Viaarxiv icon

Reliable and Responsible Foundation Models: A Comprehensive Survey

Add code
Feb 04, 2026
Viaarxiv icon

Risky-Bench: Probing Agentic Safety Risks under Real-World Deployment

Add code
Feb 03, 2026
Viaarxiv icon

Self-Guard: Defending Large Reasoning Models via enhanced self-reflection

Add code
Jan 31, 2026
Viaarxiv icon

Lingua-SafetyBench: A Benchmark for Safety Evaluation of Multilingual Vision-Language Models

Add code
Jan 30, 2026
Viaarxiv icon

Improving Alignment in LVLMs with Debiased Self-Judgment

Add code
Aug 28, 2025
Figure 1 for Improving Alignment in LVLMs with Debiased Self-Judgment
Figure 2 for Improving Alignment in LVLMs with Debiased Self-Judgment
Figure 3 for Improving Alignment in LVLMs with Debiased Self-Judgment
Figure 4 for Improving Alignment in LVLMs with Debiased Self-Judgment
Viaarxiv icon

VFlowOpt: A Token Pruning Framework for LMMs with Visual Information Flow-Guided Optimization

Add code
Aug 07, 2025
Figure 1 for VFlowOpt: A Token Pruning Framework for LMMs with Visual Information Flow-Guided Optimization
Figure 2 for VFlowOpt: A Token Pruning Framework for LMMs with Visual Information Flow-Guided Optimization
Figure 3 for VFlowOpt: A Token Pruning Framework for LMMs with Visual Information Flow-Guided Optimization
Figure 4 for VFlowOpt: A Token Pruning Framework for LMMs with Visual Information Flow-Guided Optimization
Viaarxiv icon